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Abstract 



Over the past decade, advances in super-resolution microscopy and particle-based modeling have 
driven an intense interest in investigating spatial heterogeneity at the level of single molecules in cells. 
Remarkably, it is becoming clear that spatiotemporal correlations between just a few molecules can have 
profound effects on the signaling behavior of the entire cell. While such correlations are often explicitly 
imposed by molecular structures such as rafts, clusters, or scaffolds, they also arise intrinsically, due 
strictly to the small numbers of molecules involved, the finite speed of diffusion, and the effects of 
macromolecular crowding. In this chapter we review examples of both explicitly imposed and intrinsic 
correlations, focusing on the mechanisms by which microscopic heterogeneity is amplified to macroscopic 



It has long been understood that cells perform key functions by exploiting or actively maintaining hetero- 
geneous spatial structures. Many of these heterogeneities exist at the length scale of the cell or larger. For 
example, embryonic segmentation is guided by the formation of protein gradients that stretch from one end 
of the embryo to the other [T] . Bacterial cell division is aided by the periodic localization of antagonistic pro- 
teins to either cell pole [5] . These examples are characterized by the intuitive feature that a space-dependent 
function is performed using a spatially heterogeneous process (Fig. nK). 




Recent years have seen the discovery of more subtle processes that are homogeneous on cellular scales 
but nonetheless heterogeneous on molecular scales. These processes share the remarkable property that 
microscopic heterogeneities at the level of several molecules can have macroscopic functional consequences 
at the level of the entire cell. These heterogeneities are often rooted in molecular-scale structures, such as 
cytoskeletal partitions O H] or lipid domains |S] on the plasma membrane and molecular clusters [3 [S] , 
scaffold proteins "9^ or other molecular complexes (Fig. [lj3). The ability for such small spatial structures 
to induce cell- wide changes relies on an intricate network of signaling interactions, such that molecular 
spatiotemporal correlations can be amplified to produce a macroscopic response. In this respect, microscopic 
heterogeneities are functionally important not for their spatial structure per se, but as an additional degree 
of freedom in signaling computations |10| . 

While molecular structures explicitly impose microscopic heterogeneity, recent studies have revealed that 
microscopic heterogeneity also arises intrinsically, due to spatiotemporal correlations between individual 
molecules. Intrinsic heterogeneity is strictly a consequence of small numbers of molecules, the finite speed 
of diffusion, and the crowded intracellular environment, but when coupled to a nontrivial signaling network, 
it can have dramatic effects at the cellular level. The predominant mechanism by which spatiotemporal 
correlations are introduced is via a rebinding process, whereby one molecule rapidly rebinds to another 
before diffusion can take it far enough away to effectively rejoin the pool of other molecules (Fig. [ip). In 
some cases, the effect of these rebinding events can be captured by renormalizing the parameters governing 
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a system that is otherwise treated as well mixed. In other cases, however, these rebinding events can place 
the system in a new dynamic regime or enable a new signaling pathway, leading to a cellular response that 
is qualitatively different than that of a well-mixed system. The effects of microscopic heterogeneity are 
therefore critically related to the signaling network that underlies the molecular interactions. 

From a theoretical perspective, it has become clear that the different scales of spatial heterogeneity require 
different types of models. Heterogeneity at the cellular scale and larger is well captured by reaction-diffusion 
models, which treat molecular concentrations as continuous variables that vary in space and time [TUTT]. 
Heterogeneity at the molecular scale, on the other hand, demands probabilistic models, which respect the 
fact that molecules exist in integer numbers. Spatial variation is then typically treated in one of two ways. 
Several methods have been developed for simulating the reaction-diffusion master equation [121 113j , which 
divides the system into compartments in which molecules are taken to be well mixed. Alternatively, so-called 
particle-based modeling techniques, such as lattice modeling [Tl], Brownian dynamics [T51[TH], or the exact 
Green's Function Reaction Dynamics (GFRD) [TT, track the position and time of every molecule in the 
system. Importantly, spatiotcmporal correlations between individual molecules are often only captured by 
particle-based models, even though they can produce dramatic effects at cellular scales, where one might 
naively deem a reaction-diffusion model appropriate [141 117] . 

In this chapter, we focus on the macroscopic effects of microscopic heterogeneity, reviewing recently studied 
systems in which heterogeneities at the molecular level give rise to nontrivial effects at the cellular level. We 
first discuss several systems that exploit molecular structures, such as microdomains, clusters, and scaffolds, 
to enhance or otherwise change an input-output response. We then describe several examples of systems 
that are macroscopically homogeneous but whose response is nonetheless affected both quantitatively and 
qualitatively by molecular rebinding events. Throughout, we highlight the importance of both particle-based 
modeling and experimental techniques with molecular resolution. The examples presented in this chapter 
reflect the past decade's increased interest in spatial heterogeneity at the molecular scale. 



Molecular structures modulate cellular responses 

The most straightforward way in which cells maintain molecular heterogeneity is via long-lived molecular 
structures, such as cytoskeletal compartments, molecular clusters or macromolecules, and scaffolding pro- 
teins. Many structures that play an active role in signaling are localized to the plasma membrane of the cell, 
as the membrane provides the entry point for detection of environmental stimuli. In fact, the membrane itself 
is highly dynamic and complex [T5], such that membrane heterogeneities modulate cellular signals before 
they are even relayed to the inside of the cell for further processing. 



Membrane heterogeneity: signal modulation at its entry point 

An important example of membrane heterogeneity involves the interaction of the membrane with its under- 
lying actin skeleton. The skeleton provides a network of "fences" that is anchored by transmembrane protein 
"pickets," producing a grid of compartments in which membrane-bound proteins are transiently trapped [3] 
(Fig.[l^, left). The presence of these compartments is inferred from single-molecule tracking experiments, 
which show periods of simple Brownian diffusion interrupted by sudden large hops [191 120| . At 50—300 
nm wide, these compartments are only ten or a hundred times larger than the molecules they trap, and 
because the time to diffuse through them (^150 fis) is two orders of magnitude faster than the residence 
time within them (~15 ms), they act as temporary reaction chambers |19| . Such transient trapping has 
nontrivial effects on the properties of the propagated signal. For example, trapping results in less frequent 
collisions between molecules in different compartments, but more frequent collisions between molecules in 
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Figure 1: Spatial heterogeneity is prominent at both macroscopic and microscopic scales in cellular systems. 
(A) Macroscopic heterogeneity: across many cells in an embryo, protein gradients guide segmentation (left); 
while within single bacterial cells, an oscillating protein gradient guides division (right). (B) Microscopic het- 
erogeneity: molecular structures, including membrane partitions (left), membrane microdomains (middle), 
and scaffold proteins (right), introduce microscopic heterogeneities that can have cell- wide effects. (C) Spa- 
tiotemporal correlations resulting from rapid rebinding events can also have pronounced effects on cellular 
responses. Here, ligand molecules rapidly rebind to a receptor (left), a transcription factor rapidly rebinds to 
a DNA operator site, outcompeting an RNA polymerase (middle) ; and a substrate molecule rapidly rebinds 
to a kinase to become doubly phosphorylated (right). 



the same compartment, such that diffusion-limited reactions occur in rare but potent bursts |21) . In a similar 
way, trapping enhances oligomerization, which is then further enhanced by the fact that the probability to 
hop to a neighboring compartment decreases with molecule size I18|. Thus, signals that are initiated by 
association reactions or oligomer formation can be highly modulated by the presence of compartments on 
the membrane. 

The composition of the membrane is also richly heterogeneous because the lipids that comprise the membrane 
are not completely mixed. Instead, regions enriched in glycosphingolipids and membrane proteins, often 
called lipid rafts or microdomains, transiently assemble and float within the surrounding lipid bilayer [5l[6]. 
Indeed, super-resolution microscopy has uncovered complexes of glycosphingolipid and cholesterol existing 
on ~10— 20 ms timescales and <20 nm length scales [35]. Rafts are thought to serve as platforms for signal 
transduction |23| . In particular, they have been connected to the observation that certain membrane-bound 
proteins form clusters in a cholesterol dependent manner [21]. For example, immuno-electron microscopy 
experiments have revealed that Ras, a membrane-bound eukaryotic protein implicated in a variety of pheno- 
typic responses including oncogenesis, forms cholesterol-dependent clusters of roughly seven molecules [711^ 
(Fig. [1J3, middle). Single-molecule tracking experiments confirm that Ras molecules exhibit suppressed dif- 
fusion upon activation [55J[57], suggesting further that clustering correlates with molecules' activation state 
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[25] , Ras plays a critical role in transducing extracellular signals: once activated in response to receptor 
stimulation, Ras initiates a mitogen-activated protein kinase (MAPK) cascade that ultimately leads to tran- 
scription. Theoretical studies have shown that the organization of Ras in clusters improves the fidelity of 
this transduction by converting a graded response to a digital one [53] and thereby reducing the intrinsic 
noise [3^. The discrete nature of the molecular structure therefore translates to a signaling advantage at 
the level of the entire cell. 



Clusters and scaffolds: competing effects on mean responses 

The discovery of molecular clusters on eukaryotic membranes resonates with the long standing observation 
that the receptors that initiate chemotaxis form clusters on bacterial membranes [8]. The prevalence of 
membrane clustering has thus launched considerable efforts to quantify its effect on signal detection, ampli- 
fication, and subsequent propagation. With regard to detection, for example, it has been recognized that 
receptor clustering promotes the repeated reversible binding of a ligand molecule, a phenomenon termed 
serial ligation. Serial ligation was first studied in the context of T cell activation, where it was proposed to 
account for the high sensitivity of T cells to low numbers of ligand molecules [311 132j . Brownian dynamics 
simulations have been used more recently to investigate the role of serial ligation for clustered receptors, 
revealing that rather than amplifying or attenuating signals on average, repeated ligand binding affects the 
statistics; localized detection events occur in rapid succession, with long intervals in between ^33j. Such 
spatiotemporal correlations indeed have the potential to boost the sensitivity to low ligand concentrations, 
at the cost of increasing the noise in the time-averaged detected signal |33) . 

With regard to amplification, it has been demonstrated both theoretically and experimentally that incoming 
signals can be amplified by coupling of receptors to each other within a cluster. Such coupling, in which the 
ligand-induced activation of one receptor biases the activation of neighboring receptors, has been modeled 
using techniques from statistical mechanics, originally by Monod, Wyman, and Changeux [34] and Koshland, 
Nemethy, and Filmer [3S] , and more recently by Bray and Duke [M] • Indeed, modeling [371 EH] and exper- 
iments in bacteria [39j have shown that receptors operate in a highly cooperative manner, mimicking the 
behavior of allosteric proteins, ultimately leading to enhanced sensitivity to the incoming ligand signal. 

Finally, protein clustering not only allows amplification of the incoming signal, it also changes the propagation 
of the signal from the receptors at the membrane to the downstream signaling pathway in the cytoplasm. In 
the case of both the Ras system and the bacterial chemotaxis system, the clustered proteins drive a covalent 
modification network often called the push-pull network, a motif that is also ubiquitous in intracellular 
signaling [40^ . In a push-pull network, a substrate is activated and deactivated by two antagonistic enzymes 
(Fig. [2^, inset). Lattice simulations have shown that localizing the activating enzyme to clusters on the 
membrane leads to competing effects which can either reduce or enhance the mean response [Hj . On the 
one hand, clustering the activating enzyme molecules presents a smaller target to a substrate molecule in 
the bulk. Clustering thus increases the average time for a substrate molecule to find an activating enzyme 
molecule by diffusive search, an effect that can reduce the response (Fig. [2^3). On the other hand, clustering 
promotes rapid rebinding of a substrate molecule to an activating enzyme or its neighbors. In a double- 
modification push-pull network (Fig. [2p, inset) in which the substrate molecules must rebind to an enzyme 
to become doubly activated (a so-called distributive mechanism), rapid rebinding can overcome the effect 
of the target size reduction and instead enhance the response (Fig. ^p). This example shows explicitly that 
the effects of molecular heterogeneity are critically dependent on the underlying signaling network. 

It is important to emphasize that signal detection by the reversible binding of a ligand molecule to receptors 
constitutes an equilibrium process. Changing the spatial distribution of receptors therefore changes the 
statistics of the response but leaves the mean unaffected. The reason is that the mean occupancy follows 
directly from the partition function for an equilibrium process, which in this case is independent of spatial 
configuration _14J . In contrast, signal propagation is quite often a non-equilibrium process. Indeed, Fig. [2] 
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Figure 2: Membrane clustering can reduce or enhance a signaling response. (A) A substrate (cyan) is 
covalently modified by an activating enzyme (green) and a deactivating enzyme (magenta). Activating 
enzyme molecules are either well-mixed in the bulk (left), localized to the membrane randomly (middle), 
or clustered on the membrane (right). (B) In a single-modification network, the response is reduced with 
clustering, since clustering reduces the target size as seen from the bulk. (C) In a double-modification 
network, on the other hand, the response can be enhanced with clustering, since clustering promotes rapid 
rebinding of substrate molecules. The output is normalized by the maximum value of the well-mixed curve. 
Data arc from Fig. 2 of [14,; cluster size is 25 molecules. 



demonstrates that when molecular heterogeneity is coupled to a non-equilibrium network such as the push- 
pull network, the mean response changes |141 141] . Downstream signaling from membrane clusters therefore 
provides a key example of how molecular heterogeneity can modulate mean cellular responses when coupled 
to a non-equilibrium signaling network. 

The observation that molecular structures can have competing effects on mean responses has also been 
made with regard to scaffolding proteins. Scaffolds are docking proteins that often serve to assemble the 
appropriate molecular components of signaling networks (Fig. [Tj3, right), including MAPK cascades [I^ [5]. 
Lattice Monte Carlo simulations of a MAPK cascade have shown that scaffolds amplify signals that would 
otherwise be attenuated or attenuate signals that would otherwise be amplified [13]. Amplification occurs 
when phosphatase activity is high and therefore the active state of the kinase is short-lived. In this case, 
scaffolds promote the rapid phosphorylation of downstream kinases that neighbor each other on the scaffold, 
thus preserving a signal that in solution would be attenuated by the phosphatases. Conversely, attenuation 
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occurs when phosphatase activity is low and therefore the active state of the kinase is long-hved. In this 
case, scaffolds limit the number of downstream molecules that a kinase can phosphorylate to those that are 
bound to the scaffold, whereas the kinase could instead phosphorylate many more molecules by free diffusion. 
The opposing effects illustrate that the properties conferred to a signaling network by a molecular structure 
depend nontrivially on the biochemical parameters, a feature that is also true for signals stemming from 
membrane clusters [14) . 



Macromolecules: the effect of dimensionality 

While the membrane imposes a roughly two-dimensional geometry, many macromolecules and polymers in- 
side cells, such as DNA, microtubules, and actin filaments, are quasi-linear and thus impose a one-dimensional 
geometry. Since the properties of Brownian diffusion are critically dependent on dimension [44' , many have 
wondered if the linearity of macromolecules plays a significant role in the signaling computations in which 
they take part. The question of dimensionality and signaling is perhaps best exemplified by the problem of a 
transcription factor protein locating the operator site on a strand of DNA in order to regulate the expression 
of a gene. The search can occur by a process known as facilitated diffusion, in which three-dimensional (3D) 
diffusive motion in the cytoplasm is combined with periods of one-dimensional (ID) diffusive sliding along 
the DNA gg. 

While it would seem that a combined 1D-3D search strategy might have very different statistical properties 
than 3D diffusion alone, recent work has revealed that, surprisingly, this is not necessarily the case. For 
example, one may consider the mean time for a protein to find the operator site, which is approximately given 
by (L/A)(A^/Di -I- r^/Da) [46]. Here L is the total length of the DNA, A is the average distance over which 
the protein slides along the DNA before it dissociates, r is the characteristic distance in 3D space between 
two segments on the DNA, Di is the diffusion coefficient for sliding, and D^, is the diffusion coefficient in 
the cytoplasm. This expression has a clear interpretation [35]: A^/Di is the sliding time, /D^ is the time 
spent on 3D diffusion, the sum of these terms is thus the time to perform one round of sliding and diffusion, 
and L/X is the total number of rounds needed to find the target. One immediately sees that the search time 
is minimized when A = r^jDi/D^, corresponding to equal time spent sliding in ID and diffusing in 3D. A 
key insight, however, is that on length scales beyond the mean sliding length, the motion essentially appears 
as 3D diffusion Wf]. In fact, theoretical estimates [3H] and recent experiments [321 HO] show that the sliding 
length is quite short (A < 15 nm), comparable to the typical size of the protein itself. In the end, then, 
facilitated diffusion appears very similar to 3D diffusion alone on the length and timescales that are relevant 
for the behavior of the network. 

One might further expect that facilitated diffusion would reduce the noise in gene expression caused by the 
diffusive arrival of transcription factors. This expectation stems from the intuition that noise should decrease 
with the size of the target and that permitting ID diffusion should dramatically increase the effective target 
size [3S]. However, recent theoretical work has shown that that this intuition is incorrect; instead, the noise 
from facilitated diffusion is roughly equal to the noise from 3D diffusion alone [51 . The reason is that 
the increased size of the target is largely canceled by the strong temporal correlations characteristic of ID 
diffusion. Specifically, the variance scales as 1/T in 3D but in ID, such that integrating over a fixed 

time T gives fewer independent samples in ID than in 3D [5T]. Thus, with respect to the noise as well as 
the mean, the dimensionality reduction introduced by quasi-linear macromolecules can have effects that are 
far from straightforward. 
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Spatiotemporal correlations modulate responses even in homoge- 
neous systems 



When we call a system "well mixed," we often imagine a chemical reaction in a test tube. In a test tube, the 
number of molecules involved is huge, allowing concentrations to be treated as continuous variables. Diffusion 
(or active stirring) then creates an environment in which the probability of reacting is proportional to the 
bulk concentrations of reactants. In cells, however, reactions can be far from well mixed. The number of 
molecules of a specific type can be in the tens or hundreds (or even one, in the case of, say, a DNA operator 
site). The diffusion of a handful of molecules then can no longer be treated as creating a homogeneous 
system. Instead, spatiotemporal correlations between individual molecules give rise to heterogeneity. This 
heterogeneity is quite distinct from that explicitly imposed by actively maintained molecular structures. 
Indeed, it arises strictly due to the small number of molecules and the finite speed of diffusion. Nonetheless, 
just as with molecular structures, correlations caused by this inherent heterogeneity can be amplified by a 
signaling network, giving rise to appreciable changes over the entire cell. 



Rapid rebinding 

Under the assumption that a system is well mixed, the probability that a molecule reacts within any short 
time window is constant, proportional to the bulk concentration of its reaction partner(s). A direct conse- 
quence of this fact is that the waiting time between reaction events is an exponentially distributed random 
variable. In reality, however, molecules must diffuse toward each other in order to react. The probability 
of reacting with a particular partner molecule can be heightened simply by the fact that, by virtue of a 
recent unbinding event or a random diffusive fluctuation, it is nearby. The result is that a system cannot 
be well mixed on all timescales. While on sufficiently long timescales the waiting time will be exponentially 
distributed, on short timescales it will follow a distribution whose form will include algebraic scalings typ- 
ical of diffusion-mediated interactions. This observation immediately identifies the rebinding problem — i.e. 
measuring the waiting times between successive binding events — as a key tool in exploring the continuum 
between rapid behavior that is spatiotemporally correlated and long-time behavior that is effectively well 
mixed. 

The rebinding problem is most simply posed by considering a single A molecule that can react with one of 
many freely diffusing B molecules in fixed volume according to the reversible reaction A + B ^ C. This 
problem can be modeled in a spatially resolved way using particle-based modeling techniques; results from a 
simulation with GFRD [17] are shown in Fig. [sj^. Interestingly, both exponential and algebraic regimes are 
observed in the distribution of rebinding times t. At long times, the distribution is exponential, characteristic 
of a well-mixed system. The exponential dependence corresponds to events in which a B molecule unbinds 
from the A molecule and diffuses far enough away to be effectively randomized within the bulk of other B 
molecules. At short times, the distribution is algebraic, following a scaling at the shortest times, and a 
scaling at intermediate times. The algebraic dependence corresponds to events in which a B molecule 
unbinds from the A molecule but remains in the vicinity and rebinds before escaping to the bulk — so-called 
rapid rebinding events. The shortest times correspond to collision-dominated excursions: these extremely 
local (on the order of the molecular diameter) excursions are dominated by many unsuccessful reflections 
and thus inherit the scaling from the Gaussian Green's function of a particle freely diffusing in one 

dimension [TJ]. Intermediate times correspond to search-dominated excursions: these longer excursions find 
the B particle returning to an effectively absorbing boundary, producing the t~^^'^ scaling characteristic of a 
three-dimensional random walker returning to an absorbing origin }14j . Importantly, Fig. [3]A_ demonstrates 
that the exponential portion of the rebinding time distribution is a function of the concentration of B 
molecules, as one would expect for a well-mixed system. In contrast, the algebraic portion is independent of 
concentration because it is a local feature arising from the interaction between just two molecules. 
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Figure 3: The distribution of rebinding times reveals departures from a well-mixed system. (A) Rebinding 
in three dimensions. A single A molecule binds one of 10 B molecules via A + B ^ C . The concentration 
[B\ is controlled by changing the volume. The concentration-independent algebraic scalings are due to the 
rebinding of the same B molecule, whereas the concentration-dependent exponential decay is due to the 
arrival of a i? molecule from the bulk and agrees with the well-mixed prediction. Data are from Fig. 2 of 
[TT] . (B) Rebinding to clusters on a plane. A cluster of 100 A molecules on a plane binds a single B molecule 
in the three-dimensional bulk via A + B ^ C . The algebraic scalings in the molecular and planar regimes 
are due to the rebinding of the B molecule to the same cluster and to neighboring clusters (via periodic 
boundaries), respectively. The exponential decay in the bulk regime is due to the B molecule reflecting off 
the wall opposite the plane, effectively randomizing its position and newly arriving from the bulk, as in a 
well-mixed system. Parameters are as in Fig. 4 of [14], except C = 6000 and (5 = 5; time is scaled by the 
typical time to diffuse one molecular diameter, jD. 
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The rebinding time distribution is also a useful tool for understanding the spatiotemporal effects of molecular 
structures, such as the membrane clusters depicted in Fig. |2j\ (right). Analytic tools [SH [531 [SI] , Brow- 
nian dynamics simulations (33^ and lattice simulations [14, _53J have been used to study the distribution 
of times for a particle to rebind to a set of clusters on a plane. Fig. |3j3 shows that such a distribution 
also exhibits exponential dependence at long times (the bulk regime) and algebraic scalings at short times. 
Here, however, the algebraic regime exhibits two subregimes. The shortest times (the molecular regime) 
correspond to rebinding to the same cluster, producing the same and f"'^/^ scalings seen in Fig. [sj^ 

from collision-dominated and search-dominated excursions, respectively [14j . Intermediate times (the planar 
regime) correspond to excursions in which the particle diffuses far enough away that the granularity of the 
clusters disappears, and the plane appears to be uniformly covered with the activating enzyme. This regime 
also exhibits and scalings, corresponding once more to collision-dominated and search-dominated 

returns, respectively [14] . Importantly, the weight of the distribution within the molecular regime is the key 
determinant of the strength of rebinding and therefore underlies the enhancement of the mean response for 
the double-phosphorylation network seen in Fig.[2|Z! (14] , 



Renormalization: integrating out the rebinding 



The fact that rebinding events are so rapid when compared to the timescale of bulk redistribution raises the 
question of whether rebinding can be captured in a more coarse-grained description |481 155| . That is, it is 
natural to ask whether rebinding can be absorbed into the description of an effectively well-mixed system 
by a simple rescaling of the binding and unbinding rates. Indeed, van Zon et al. showed [48j that in the 
context of gene regulation, the effect of the diffusion and rebinding of transcription factor proteins to a DNA 
operator site can be captured in a well-mixed model by renormalization of the rates as follows: 
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(1) 

fcoff = (t^ + ^I • (2) 



Here, kd is the dissociation rate of the protein from the operator site, ka is the intrinsic association rate 
(i.e. the rate of binding given that the protein is in contact with the operator site), ku is the diffusion- 
limited association rate (i.e. the rate of a protein finding the operator site from the bulk; in three dimensions 
ko — AnaD for binding cross-section cr), and Kcq = ka/kd is the equilibrium constant. The rates fcon 
and kos are not the intrinsic association and dissociation rates. Instead, fcon and fcoff are effective rates, 
accounting both for the intrinsic activity and for diffusion. Thus, the result naturally suggests a coarse- 
grained interpretation of binding as an effectively two-state system, so long as we properly renormalize the 
binding and unbinding rates as in Eqns. [T]and[2] 

Interestingly, Eqns. [T] and [2] have a clear interpretation in terms of the number of rapid rebinding events |55) . 
The probability of rebinding n times, then exiting to the bulk, is (1 — p)"'p, where p = ku/^ka + fcn) is the 
probability of escaping a reactive origin in three dimensions j48j . The average number of rebinding events is 
then N = X^^o "(-'^ ^pY^P = (1 ^P) /P = ka/ko. Assuming the time to rebind is short compared to the time 
spent bound fc^^, the total time spent bound can be estimated as (1 -I- iV)fc^^ — kd^ + K^q/ko = k~g. We 
thus arrive at a very useful interpretation of Eqn.[2j fcoff is the coarse-grained dissociation rate, renormalized 
by averaging over the rapid rebinding events. Because reversible binding is an equilibrium process and 
therefore obeys the detailed-balance condition K^q = ka/kd = fcon/fcofi, it follows that the coarse-grained 
association rate must be renormalized by the same factor (1-|- A'^). Indeed, renormalizing, we reproduce Eqn. 
[1} (1-1- N)k^^ = 1/fc, + 1/kn - fco-^ 

The validity of renormalization was demonstrated by showing that this approach not only captures the mean 
but also the noise in downstream transcription and translation. Particle-based simulations with GFRD [IS] 



9 



were used to study the dynamics of a transcription factor that represses transcription by binding to the DNA 
operator site, thus interfering with the binding of the RNA polymerase (Fig.[lp, middle). The effective bound 
time of the repressor is extended by rapid rebinding events, which in this case can dramatically increase the 
noise in the production of mRNAs and proteins. Because these rebinding events occur on a much faster 
timescale than the binding and unbinding of the RNA polymerase, groups of rebinding events can safely 
be treated as single binding events in a well-mixed description with the appropriately renormalized rates. 
Simulation of transcription and translation using a well-mixed model with these renormalized rates then 
indeed reproduces with high accuracy the noise in the numbers of mRNAs and proteins (Fig. |4j'V and B). 
Renormalization is thus a powerful tool in this context, not only providing a simplified description of the 
rapid rebinding process but also yielding accurate statistics for its downstream effects. 

The treatment of rebinding in a renormalized description has more recently seen favorable comparison with 
experiments on T cell receptors [SSJ [57] . A long-standing puzzle in the field of T cell ligand potency was 
whether the response of a T cell is governed by how long a ligand molecule stays bound to a receptor, or how 
often receptors are occupied by ligand molecules on average. Eqn. [2] illustrates that the effective bound time, 
once renormalized for rebinding, actually depends on both. That is, the effective bound time fc^^ depends on 
both the intrinsic bound time fcj^ and the equilibrium constant K^q, which dictates the occupancy. Indeed, 
using the two-dimensional analog of Eqn. [2] (to account for the planar nature of the membrane), several 
recent studies [SHI EZj have shown that potency data best support a model in which T cells respond to the 
effective bound time, instead of to the intrinsic bound time or the occupancy alone. 

Additionally, a renormalized description of rebinding has quite recently been invoked to explain the observed 
kinetics of Src homology 2 proteins binding to receptor tyrosine kinases at the membranes of human cells [5S] . 
The authors observed a ^20-fold decrease in the protein dissociation rate relative to in vitro measurements, 
which they explained via a reduced effective off-rate of the form of Eqn. [2] finding that a typical protein 
locally rebinds to a receptor ^20 times. Furthermore, the authors confirmed that the effective off-rate 
is dependent on the diffusion constant of the proteins in the cytosol (which explicitly enters Eqn. [2] via 
k]j). Interestingly, they also observed that the receptors form clusters upon stimulation, further enhancing 
rebinding, which indeed underscores the intimate link between clustering and rebinding discussed above. 



Beyond renormalization: when rebinding causes new behavior 

The success of the renormalized description of rebinding relies strongly on the presence of a timescale sepa- 
ration. For example, when applying the description to gene regulation in the previous section, it was critical 
that the rebinding time of the repressor is much faster than the association time of the RNA polymerase. 
This timescale separation allows one to treat a group of rapid rebinding events as a single event and to 
model the downstream dynamics accordingly. Importantly, however, not all systems prone to rapid rebind- 
ing exhibit such a timescale separation. The spatiotemporal correlations induced by rebinding can then no 
longer be captured by simple renormalization; instead they introduce a qualitative change in the signaling 
response. 

A well studied system in which rebinding has been shown to cause qualitative signaling changes is the 
MAPK cascade. In each layer of the MARK cascade, a substrate is phosphorylated and dephosphorylated 
at two sites by a kinase and a phosphatase, respectively. The network is therefore as shown in the inset 
of Fig. [2p, except that here the kinase (Ea), the phosphatase (Ed), and the substrate {S) are all freely 
diffusing in three-dimensional space. In general, two-site modification can be carried out by one of two 
mechanisms. In a processive mechanism, an enzyme modifies both sites on a substrate molecule before 
releasing it. In a distributive mechanism, on the other hand, the enzyme must release the substrate molecule 
in between modification of the two sites. Modeling has revealed that the choice of mechanism has important 
functional consequences. A distributive mechanism generates an ultrasensitive (very sharp) response because 
the concentration of doubly phosphorylated substrate depends quadratically on the kinase concentration 
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Figure 4: Examples in which rebinding can (A, B) and cannot (C, D) be renormahzed. (A, B) A transcription 
factor represses production of mRNA and protein by competing with RNA polymerase for the DNA operator 
site. Spatiotemporal correlations between the repressor and the DNA, as captured by the particle-based 
model, dramatically increase the noise (variance divided by the squared mean) in the mRNA number (A) 
and protein number (B) beyond that of the well-mixed model. Because the timescale of repressor rebinding is 
much faster than that of RNA polymerase binding, the increased noise is well captured by a well-mixed model 
with rates renormahzed by the average number of rebinding events. Data are from Fig. 2 of |48j . (C, D) A 
distributive dual phosphorylation network, as in one layer of a MAPK cascade. As the diffusion coefficient 
decreases, the input-output response becomes less ultrasensitive (i.e. less sharp), ultimately approaching that 
of a processive network (C). In the bistable regime, as the enzyme reactivation time decreases, the system 
goes from bistable to monostable (D). In D, the dashed lines show the two bistable states of the well-mixed 
system, while the blue, green, red, and cyan data correspond to successively larger initial conditions for 
the fraction of doubly phosphorylated substrate. It is seen that at enzyme reactivation times greater than 
~100 /US, simulations proceed to one of two fixed points, indicative of bistability, while at times less than 
'-^lOO fis^ they converge to one monostable state; in between, the system is close to a critical point, with 
concomitant large and long-lived fluctuations. Loss of ultrasensitivity and bistability constitute qualitative 
changes in the system behavior that depart from the well-mixed predictions in ways that cannot be captured 
by renormalization. Data are from Fig. 5 (C) and Fig. 6 (D) of [17] . 
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[521 HO]- Moreover, a distributive mechanism can lead to bistable behavior of the substrate phosphorylation 
level if the enzymes are present in limiting amounts f61j . 



Importantly, these theoretical predictions for the behavior of a distributive network are based on mean-field 
arguments [591 1601 ICT] . in which the system is assumed to be well mixed. To test these arguments, particle- 
based simulations with GFRD were performed on a MAPK pathway [T^, in which it has been shown 
experimentally that kinases [551 (53] and phosphatases [53] can act in a distributive manner. Strikingly, the 
simulations revealed that both ultrasensitivity and bistability can be lost in regimes in which well-mixed 
models predict they should be present. The reason for this loss is rapid rebinding: after a substrate molecule 
is phosphorylated at its first site and released, it is still close to the kinase and can with high probability 
rebind to become doubly phosphorylated before diffusing away (Fig. [TjC, right). In essence, rapid rebinding 
can turn a distributive mechanism into a proccssive one, removing the qualitative behaviors associated with 
distributive modification. The distinction between a distributive and a processive mechanism then becomes 
not whether a substrate molecule remains physically connected to its activating enzyme, but rather whether 
both sites are modified by the same activating enzyme molecule. 

Once the effect of rebinding is understood, it becomes clear that the key parameters governing the loss of 
ultrasensitivity and bistability are the diffusion coefficient and the timescale of enzyme reactivation after 
the substrate is released (which, in the case of a kinase, could reflect the time for ADP to be exchanged 
for ATP [TT]). Specifically, the diffusion coefficient must be sufficiently small that rebinding is favored over 
diffusion into the bulk, and the reactivation time must be sufficiently fast that the enzyme is active once 
rebinding is attempted. Indeed, given a sufficiently fast reactivation time, the input-output response of the 
dual phosphorylation cycle shifts from an ultrasensitive response to a significantly more shallow response as 
the diffusion coefficient is lowered, ultimately approaching the response of a processive mechanism (Fig.BC). 




Furthermore, in the bistable regime, measuring the final fraction of doubly phosphorylated substrate over 
a range of initial conditions reveals that the system shifts from bistable to monostable as the enzyme 
reactivation time is reduced (Fig. |4|l)). This loss of bistability can be understood at a mechanistic level. 
When the reactivation time is high, rapid rebinding is suppressed, and the system behaves like it is well 
mixed. In a well-mixed system, bistability arises due to enzyme sequestration: if the fraction of inactive 
substrate is high, the majority of activating enzyme molecules will be occupied by substrate molecules, such 
that when a substrate molecule gets released in its singly active state, it is more likely to encounter a free 
deactivating enzyme molecule than a free activating enzyme molecule, thereby maintaining the high fraction 
of inactive substrate. A symmetric argument holds for the case when the fraction of doubly active substrate 
is high, thus supporting bistable behavior. However, when the reactivation time is low, rapid rebinding is 
possible. Then, when a substrate molecule gets released in its singly active state, it can rapidly rebind to the 
activating enzyme molecule that just released it, even though there are many more free deactivating enzyme 
molecules in the bulk. Rapid rebinding thus circumvents the mechanism that supports bistability, resulting 
in a monostable system. 

Both functional changes — the loss of ultrasensitivity and the loss of bistability — are direct results of rapid 
rebinding, and as such they are not captured by a well-mixed model. Moreover, they constitute qualitative 
changes and are therefore not describable by simple renormalization. Indeed, both effects arise strictly due 
to molecular spatiotemporal correlations in an otherwise homogeneous system. When coupled to the MAPK 
signaling network, these microscopic correlations induce macroscopic changes for the entire cell. 



When rebinding can be integrated out and when it cannot 

We are now in a position to understand more generally the distinction between processes in which rebinding 
can be captured by renormalization and processes in which rebinding induces a qualitative change (Fig.[5|. 
Renormalization is possible when (i) the state of the system is the same before and after rebinding (Fig. 
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Figure 5: Abstract representation of the effects of rebinding. Dots represent the states of a system, sohd 
arrows represent paths taken by the system in state space (with curved arrows representing rebinding), 
and dashed arrows represent untaken paths. (A) Renormalization is possible when rebinding returns the 
system to the same state and when it does so on a timescale that is faster than that governing the system 
behavior. The system then behaves as if it were well mixed, but with renormalized rates. (B) On the other 
hand, a qualitative change is induced when rebinding takes the system from one state (1) to another state 
(2). Whereas in a well-mixed model the transition from state 1 to state 2 occurs with low probability, in a 
spatially-resolved model it may occur with high probability due to rebinding. For example, in the MAPK 
system, after a kinase has just released its substrate molecule in the singly phosphorylated state (state 1), 
the substrate molecule either can bind a kinase or a phosphatase from the bulk, leading to ultrasensitivity 
or bistability (downward arrow), or it can rebind to the kinase that just released it (state 2), leading to a 
loss of ultrasensitivity or bistability (upward arrow). 

[5j\) and (ii) the timescale of rebinding is faster than the timescale that governs the system dynamics. Both 
features are required in order to treat a group of rebinding events as a single effective event, which is the 
assumption that underlies renormalization. As we have seen, this type of process is exemplified by gene 
regulation: the transcription factor-DNA complex is identical before and after a rebinding event, and the 
rebinding time is much faster than the RNA polymerase association time, making it possible to integrate 
out the rebinding as if the RNA polymerase were not present. On the other hand, a qualitative change 
is induced when rebinding takes the system from one state to another state, in turn accessing a different 
dynamical pathway (Fig. [5^3). Whereas in a well-mixed system this pathway is suppressed, when rapid 
rebinding is possible this pathway is allowed, ultimately leading to behavior that is different from the well- 
mixed prediction. This type of process is exemplified by distributive phosphorylation: after a kinase has just 
released its substrate molecule in the singly phosphorylated state, the substrate molecule either can bind a 
kinase or a phosphatase from the bulk, leading to ultrasensitivity or bistability, or it can rebind to the kinase 
that just released it, leading to a loss of ultrasensitivity or bistability. 



Macromolecular crowding and anomalous diffusion 

The requirement that molecules diffuse toward each other in order to react is compounded in the intracellular 
environment by the high density of other macromolecules such as DNA and proteins that are not involved 
in the reaction, a phenomenon termed molecular crowding [651. Molecular crowding is quite significant 
in cells: the concentration of macromolecules in a prokaryotic cell is typically 300—400 g/1 implying 
that macromolecules occupy up to 20—30% of the cellular volume ^7\. In fact, in eukaryotic cells, the 
nucleus is particularly crowded, containing up to 400 g/1 of chromatin alone |68j . Because crowding can have 
significant effects on the diffusion and effective reaction rates of molecules, crowding can affect cell signaling 
in functionally important ways. 
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Specifically, crowding has two principal effects. First, it enhances the probability for reactants to be in 
the bound state, thereby shifting the equilibrium constant of the reaction. The enhancement is due to an 
effective attraction between molecules which is often called a depletion interaction or an excluded volume 
interaction. The attraction arises directly from the fact that molecules typically take up less total volume in 
the bound state than in the unbound state, which frees up volume for the surrounding crowder molecules, 
thus decreasing the free energy of the system by increasing the total entropy |69j ■ Second, crowding changes 
the dynamics. At long length and timescales (e.g. length scales larger than that of the spatial heterogeneity 
in the system), the diffusion of molecules appears normal (i.e. the mean squared displacement scales 
linearly with time t), but with an effective diffusion coefficient that is reduced by up to several orders of 
magnitude [70]. On short length and timescales, on the other hand, crowding can lead to anomalous diffusion, 
or, more precisely, subdiffusion {A"^ (x t" with a < 1) [7TJ [TH [731 [TJ] . Mechanisms by which crowders induce 
such subdiffusive motion include forcing a molecule to explore its environment densely (i.e. with significant 
oversampling) , such that initial positions take much longer to be forgotten, a type of motion that has been 
termed compact diffusion f74j. 

The anomalous diffusion induced by crowding leads to enhanced rebinding |75| . Indeed, recent analytic 
work has shown that subdiffusion significantly extends the timescales over which rebinding occurs |74l 175) , 
which leads to nontrivial non-exponential waiting time statistics. However, the transition between anomalous 
and normal diffusion occurs on a relatively short timescale. The transition has been measured in vivo, for 
example with quantum dots in the nuclei of eukaryotic cells, to occur on a timescale of approximately 20 
ms [5S]. Given that this timescale is typically faster than the timescales governing gene expression, and 
further that the rebinding of transcription factors does not take the system to a new state (Fig. [5]^), it 
has been argued that the effects of crowding on gene expression can be captured by renormalization of 
the rate constants, with exponential waiting time distributions for association and dissociation [76^. This 
approach was used to study the effects of crowding on gene networks. Specifically, Monte Carlo simulations 
following this argument revealed that the effects of crowding on gene expression are mainly caused by the 
shift of equilibrium constants rather than the slowing of diffusion [76| . On the other hand, in signaling 
networks such as the MAPK network, in which rebinding takes the system to a new state (Fig. [5|3), the 
effects of crowding can be nontrivial. Indeed, crowding-induced rebinding was predicted to contribute to 
the mechanism by which the distributive double phosphorylation mechanism present in the MAPK network 
can exhibit a processive response (Fig. |4]C) [17]. In fact, subsequent experiments on the MAPK pathway 
in mammalian cells have confirmed that crowding can indeed convert a distributive mechanism into an 
effectively processive one [77] . 



Outlook 



We have highlighted recent examples of investigations that uncover the mechanisms by which molecular 
heterogeneity can affect cellular responses. We have seen that spatiotemporal correlations introduced by 
molecular-scale structures, such as membrane domains, clusters, scaffolds, and macromolecules, can be am- 
plified by biochemical networks, leading to a changes for the entire cell. Moreover, we have emphasized that 
intrinsic spatiotemporal correlations remain present even in macroscopically homogeneous systems, due to 
the small number of reacting molecules, the finite speed of diffusion, and the crowded cellular environment, 
and that these intrinsic correlations can be just as potent at inducing cellular changes. 

Several themes emerge from our discussion. We have seen, for example, in discussing the effects of serial 
ligation, that while changing the spatial structure (e.g. by clustering) can affect the statistics of an equilibrium 
binding process, coupling to a non-equilibrium network is required to change the mean response. Further, 
we have seen that renormalization relies not only on a separation between the rapid timescale of rebinding 
and the slower timescale of network dynamics, but also on the property that rebinding returns the system 
to the same state from which it left. When rebinding takes the system to a new state, renormalization is not 
possible; instead rebinding can lead to a qualitatively different response (Fig.^. It will be interesting to see 
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how these themes inform future investigations and whether they can be sharpened into more quantitative or 
more general principles. 

Although we have focused on specific examples, the effects of molecular heterogeneity are broad in scope. 
The spatial organization of receptors and associated membrane-bound proteins is a key determinant in the 
detection of external signals by cells as diverse as bacteria [78 and immune cells ^79) . Furthermore, niultisite 
phosphorylation of the type that shapes the response of the MAPK pathway is ubiquitous in signaling 
networks, including those that govern bacterial circadian rhythms |817j . the timing of the cell cycle in yeast 
[81j . and long-term memory in the nervous system [82j . 

Further investigation of molecular heterogeneity and its functional consequences will continue to rely on 
advances in both experimental and theoretical techniques. Recent years have seen exciting advances in 
super-resolution microscopy, including the coordinated application of an excitatory laser pulse with a con- 
centric de-excitatory pulse (S3] and the stochastic activation of single fluorescent molecules (HH US]. In 
parallel, advances in particle-based modeling [15l [TBI EZ] and computational power have allowed progres- 
sively more detailed quantitative predictions to be made. It will be exciting to see what continued advances 
on both the experimental and theoretical fronts — and the fruitful exchanges between them — will bring for 
the understanding of how cellular responses are shaped at the molecular level. 
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